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Abstract 

Asymptotic analysis on some statistical properties of the random binary-tree model 
is developed. We quantify a hierarchical structure of branching patterns based on the 
Horton-Strahler analysis. We introduce a transformation of a binary tree, and derive a 
recursive equation about branch orders. As an application of the analysis, topological self- 
similarity and its generalization is proved in an asymptotic sense. Also, some important 
examples are presented. 
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1 Introduction 

Branching patterns are universal in nature, including river networks, blood vessels, and 
dendritic crystals [1,2]. They usually exhibit intricate forms (some patterns have been treated 
as fractal or multifractal objects). Branching structures are also fundamental and important 
tools for illustrating some data structures in computer science [3] and the classification of species 
in taxonomy [4]. For the analysis of branching patterns, the topological (or graph-theoretic) 
properties are important as well as the geometrical ones, and even a topological structure is 
still complicated. 

The topological structure of a branching pattern is expressed by a binary tree, if the pattern 
is loopless and all the branching points are two-pronged. A binary tree can be regarded as 
a nested structure of the parent-child relations of nodes (see Fig. [I]). In order to derive 
quantitative characteristics about binary trees, Horton [5] has introduced the idea of branch 
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Figure 1: A binary tree of magnitude 6. Nodes are represented by open circles, and numbers 
on them are the corresponding Horton-Strahler indices. 



ordering. For mathematical convenience, Horton's method has been refined by Strahler [6]. 
The basic idea of their methods is assignment of numbers, referred to as Horton-Strahler index, 
to the nodes of a binary tree. 

Horton-Strahler ordering for a binary tree is defined recursively as follows (see also Fig. 
[1]): (i) each leaf is assigned the order 1, (ii) a node whose children are both rth is assigned 
r + 1, (iii) a node whose children are rith and r2th (ri ^ r2) is assigned maxjri,^}. In a 
binary tree, rth branch is defined as a maximal path connecting rth nodes. The ratio of the 
number of branches between two subsequent orders is called bifurcation ratio. It has been 
revealed that the bifurcation ratios become almost constant for different orders in some actual 
branching patterns [6-11], which is referred to as topological self-similarity [12]. As a typical 
instance, many river networks possess their bifurcation ratios between 3 and 5 irrespective 
of orders [5, 12]. The relevance of two types of self-similarity, 'original' self-similarity and 
topological self-similarity, has been considered in ramification analysis [13,14,17,18]. 

The number of leaves of a binary tree is called magnitude, and let fl n denote the set of 
topologically different binary trees of magnitude n. The number of the elements of Q n is given 
by 

(2n-2)! _ 

B"n = —r, ttt = c n _i, (1) 

nl(n — 1)1 

where c n _i is (n — l)th Catalan number [19]. One of the most simple model of a branching 
structure is called random model [20] , where all the binary trees in Q n emerge randomly. More 
accurately, the random model is a probability space (Q n ,P), where P represents the uniform 
probability measure on Q n , i.e., every binary tree T e Q n has the same statistical weight l/c n _i. 
We denote by E n [■} an average over Q n . We introduce a random variable SV,n : 0„ -> N U {0} 
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such that S r ^ n (T) represents the number of rth branches in a binary tree TgO„. 
The rth bifurcation ratio R r ^ n on (Q n , P) is defined as 

En [S rtn ] 
-^Wi Pr+l,nJ 

and topological self-similarity has been confirmed in the case where magnitude n is sufficiently 
large. In fact, Moon [21] has derived 

E n [S r , n ] = 4^71 + 1 ~f ' + 0(n- 1 ), 

o 

and 

^,„ = 4-|^ + 0(n- 2 )->4 (n-oo). (2) 

Therefore, the random model is topologically self-similar in an asymptotic sense, and the limit 
value of Rr >n is 4. Moreover, the present authors [22] have derived 

n[r ' nl -A k ~- - + 0(n- 2 ) 



E n [^ +1) J 2n 

- 4 fc , (3) 

and this relation can be regarded as a generalization of Eq. (J2J). Other results on the Horton- 
Strahler analysis and tree structures are found in Refs. [24-31]. 

In the present paper, we focus on a random variable f(S r;n ), where / : N U {0} — > R (or C) 
is a certain function (further assumptions for / are stated later). We first derive a recursive 
relation between E n [/(SV,n)] and E m [/(SV-i,m)]- Then, we also derive the asymptotic form of 
E n [f(S r , n )], and show topological self-similarity about / (or simply referred to as generalized 
topological self-similarity), in the sense that /-bifurcation ratio 

R f = E n [f(S rin )] 
E n [f{Sr+l,n)\ 

is asymptotically independent of r. Cleary, R( n is reduced to R r<n when f(x) = x. 



2 Transformation of binary tree 

First, we introduce a transformation $ n . For a binary tree T G Q n , a new binary tree <$> n (T) 
is constructed from the following two steps: (i) remove all the leaves from T, (ii) if a node with 
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only one child appears, such a node is merged with its child (this operation is called contraction 
in the graph theory). Figure [2] illustrates these two steps. The magnitude of $„(T) is at most 
|_n/2j, because a pair of first-order branches is needed to create a second-order branch. Hence, 

LiJ 

$ n : Q n -> |J tt m . 

m=l 




Figure 2: An illustration of $ n for n = 12: (i) removal of the leaves of T, (ii) contraction. 

We introduce VL™ = $~ 1 (f2 m ), which is explicitly expressed as Q,™ = {T E Q n ; S2, n (T) = m}. 
{f2™} m is a partition of fl n , that is, 

LiJ 

m=l 

By definition, we have SV_i im ($ n (T)) = S rjn (T) if T e and we regard that this is a relation 
which connects variables about two subsequent orders (rth and (r — l)th). For example, as for 
the binary tree T in Fig. [2J {n = 12, m = 4), we can easily check the following relations: 

St, m (M T )) = 4 = ^2,n(T), S 2 , m ($ n (T)) = 2 = S 3 , n (T), S 3 ,m(M T )) = 1 = <S 4 ,n(T). 

Note that the restriction $„|q™ : fl™ — > fl m is not one-to-one (see Fig. [3] for example). 
Then, for a binary tree r e f2 m (1 < m < [w/2j), we introduce multiplicity 

/C(r) = e fin|$n(T) = r} = ^; x (r). 

In order to calculate /i™(r), we trace the inverse process $~ 1 (r). As mentioned above, 
is a removal of the leaves of a binary tree, and multiplicity is concerned with a contraction 
process. Thus, the inverse process <&~^{t) can be formed by attaching n leaves to r in the 
following way. 
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Figure 3: Some binary trees are mapped to the identical binary tree by an operation of $ n . 
The multiplicity in this case is n\ = 6 (n — 5, m = 2). 



(i) A pair of nodes is attached to each leaf of r. 

(ii) n — 2m 'intermediate' nodes are added in the form of a chain, which is the inverse of 
contraction. The number of different ways of adding amounts to ( ) . 

(iii) n — 2m leaves are attached to the nodes added in (ii). Each leaf can be attached indepen- 
dently from either left or right, thus the total number of ways of choosing sides is given 
by 2 n_2m . 

A series of procedures is presented in Fig. HI From these steps, the total multiplicity /-t™( r ) is 
calculated as 

n-2 



n — 2m 



tn~2m 



(4) 



which depends only on n and m, and not on r G f2 m . Therefore, we hereafter write /z™ = /i™(r) 
with no confusion. The connection between $ n , and /z™ is depicted in Fig. From the 
figure, we can derive the following relation: 



(5) 



By Eqs. © 



and ([5]), the number of the elements of f2™ is expressed as 

(n-2)!2 n " 2m 



n-2 \ n _ 2m (2m-2)\ 
n — 2m I m\(m — 1) 



{n — 2m)\ ml {m — 1)! 
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(a) (b) (c) (d) (e) 



Figure 4: An example of $~ 1 (r) for n = 11 and m = 3: (a) an initial binary tree r G Q m , (b) 
a pair of nodes is attached to each leaf of r (indicated by the dashed lines), (c) intermediate 
nodes (black nodes in the figure) are added, (d) new leaves (hatched nodes) are attached to the 
intermediate nodes, and (e) generated binary tree. 




(a) (b) 



Figure 5: Schematical illustration of $ n , and n™. (a) {fi™} m is a partition of Q n , and 
$ n (f2™) = Q m . (b) The open and solid circles represent individual binary trees (elements of 
tt™ and Q m , respectively). Vertically aligned points in f2™ are mapped to the identical point 
in Q m . 
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Therefore, the average of random variable f{S rtn ) is expressed as 

E n [f(S r , n )\ = — V f(S r , n )(T) 



m=l TeSl 



LfJ 

"EE /(Sr-l,m)(*»CO) 



C-n — 1 , , i 
m=l TeSl 



-E^ E f(Sr-l,m){T) 



° n 1 m=l r-<> 



^ ] l^ c m-\E m [f {S r —i )Tn )] 



Cn-1 

m=l 



I " I 



n!(n- l)!(n-2)! ^ 

( 2n - 2)! 2^ x (n - 2m)!m!(m - 1)! m LA^-i,mjj • \p) 



Eq. (jBJ) is a recursive relation about rth variable f(S r , n ) and (r — l)th variable f(S r -i jm )- The 
present authors [22] have derived a similar recursive equation for E n [S^ n ] and E m \S^_ lrn \ . 
Compared with the former result, Eq. is more general and derivation is much easier. 
Yekutieli and Mandelbrot [24] have derived that the value 

n!(n - l)!(n - 2)!2 n - 2m 
(2n - 2)\(n - 2m)\m\(m - 1) 

is the probability of finding a binary tree of magnitude n with m branches of order 2. 

3 Asymptotic expansion of E n [/(SV,n)] 

In this section, we derive the asymptotic form of E n [f(S r>n )] by using the recursive equation 
([6]). Let us assume that the function / has the following expansion: 

E n [f{S ltn )\ = E n [f{n)\ = f{n) = a x n k + b x n h ~ x + 0(n k ~ 2 ). (7) 
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We regard Eq. ([7]) as the initial condition of the recursive equation We also assume that 
E n [f(S r ,n)] nas the form 



E n [f{S r>n )} = a r n k + b r n k - x + 0(n k - 2 ), (8) 

where the coefficients a r and b r are independent of n. 

The present authors [22] have already derived the asymptotic form of the kth moment of 

S 2 , n as 

1 L*J 

En RJ = VnCm-im k 

Cn ~ X m=l 



i + TT" )+0(n k - 2 ). (9) 



n 

1) \~ ' 2nJ 



By substituting Eq. ([8]) into Eq. ([6]) and using Eq. (Q, the average of f(S r>n ) can be calculated 

as 

, LsJ 

E n [/(&,„)] = J2 VnCm-i {a r ^m k + b r ^m k ~ l + 0(m k - 2 )) 

C n -1 ~i 
m=l 

= a r ^E n [S* n ] + b r ^E n [S k 2 ~ l ] + 0(n k - 2 ) 

= i?™*+(|^ + w)"" 1 + 0( "" ) (10) 

Comparing 0{n k ) terms of Eqs. ([S]) and ffTUl) . we get a recursive equation about {a r }\ 

_ Qr-l 

and the solution a r is 

1 ^ '- 1 



4*1 «i- (n; 



Similarly, 0(n x ) terms yield an equation about {b r }: 

, 6 r -i k 2 a r ^i 6 r _i A; 2 ai 1 
o r = ~n 7 + — — nr = ~n—- r + 



zjfc— i 2 • 4 fc 4 fc ~ 1 2 4 fcr 
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The solution of this equation is given by 



r-l 



&1 + 



k 2 ai 4 



r-l 



1 



(12) 



4*;-l J 4^ Q 

We note that the general solution of the recursive equation x r+ ± = sx r +t ■ u r {s ^ u) is given 
by 



X\ + tu- 



U — S 



and in this case we set s = ^rr, t = ^p-, and 



u 



Substituting Eqs. (fTTj) and f[T2"j) into Eq. (jTUJ), one can obtain 

n \ * f 1 / . . 4 r_1 - 1 



-E-n [/(<SV,n)] 



4' 



r-l 



ai + - ( A r -\ + 
n 



6 



fc-2\ 



(13) 



Eq. fflBl is the asymptotic expansion of E n [f(S r>n )). 

A similar formula can be derived for ap-variable function f(Si >n , S 2 , n , ■ • • , S p , n ). For simplic- 
ity, we introduce the notation / (^S^nj = /(SV, n , S r +i >n , ■ ■ ■ , S r +p-i,n)- Assuming the following 
asymptotic form 



E„ 



f 1% 



(p) 



ai n k + bm k - 1 + 0(n k -' 2 ) 



k-2\ 



then the asymptotic form of E n f(S$l) is expressed as 



En[f (Sj»)] 



n 



1r-l 



0.1 



1 f at-Xi. 4r_1 - !,2 
4 r 1 6 1 H A; 2 ai 



6 



Oirf 



k-2\ 



(14) 



(15) 



4 Generalized topological self-similarity 

Using the asymptotic formulas f[T3"j) and f[T5"j) . we can easily show generalized topological 
self-similarity. The asymptotic form of a generalized bifurcation ratio W rn is calculated as 



R f = 



E„ 



f 1 c(p) 

J I >->r,n 



1 k+r—l 



(661 + aik 2 



E 



f ( si p) 



r+l,n 

4 fc as n — >• 00 



+ 0(n- 2 ) 



(16) 



Therefore, on the random binary-tree model, topological self- similarity about / is concluded in 
an asymptotic sense, if / has an expansion as in Eqs. (jTJ) or (fli]) . Note that the limit value of 
R{ n depends only on the dominant order k of /. 
Here, we provide several examples. 
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1. We start from f(n) = n k (ai = 1, b\ = 0), and E n [f(S r>n )] = E n \S*A is the fcth moment 



of S rn . In this case, Eq. (TT3j) is reduced to 



En [S r ,n] 



71 



ir—1 



1 + 



4 r 



r-1 



6n 



0(n 



fc-2\ 



and the asymptotic form of R{ n is 



Ak+r-lh.2 

R{ n = A k -^-^ + 0(n^). 

Thus, Eq. (J3J) is rederived. 

2. We next consider the asymptotic property of the variance of S r>n . By the definition 
Si. n = n, var (<5i jn ) = is easily obtained. The analytical expression of the variance of 
S 2>n is given by 



var (S 2 , n ) = E n [(S 2 , n ~ E n [S 2 , n ]) 2 ] 



n(n — l)(n — 2)(n — 3) n 1 
2(2n-3) 2 (2n-5) ~ 16 ~ 32 



+ 0(n" 1 ), (17) 



which has been obtained by Werner [25]. We think that var {S\ yn ) = is exceptional. 
Regarding Eq. ( JT71) as the initial condition of calculation ( a i = jg, bi = and k = 1), 
the asymptotic form of var (SV,n) is calculated as 



var (5V 



71 



1 



4 r 48 6 • 4 



7 + OCn" 1 ). 



Therefore, for sufficiently large n, the variance var (SV,n) decreases almost exponentially 
with an increase of r. 



3. We next deal with a two-variable function f(Si, n , S 2 , n ) = S2,n/5i, n (= S^nAO- According 
to the result 

p \q l n ( n ~ X ) 

^ [62 ' nJ " 2(2n - 3) 

obtained by Werner [25], the initial condition (fl4"j) in this case is calculated as 



En 



S 2 ,n 



2.71 



2,n 



71 



n — l 1 1 „, _ 2x 



Thus, we have a\ — j, & a = | and fc = 0, and Eq. ([TBI) yields 



"5. 



r+l,n 



\r-2 



A + 2n 



+ 0{n- 2 ). 
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On the other hand, by using Eq. 



En [S r ^n\ Pr,n 4 2fi 



In conclusion, we obtain 



E„ 



S„ 



r+l,n 



-E'n ['S'r+l.n] 



if 0(n~ 2 ) terms are neglected. This relation is quite simple in appearance, but it is 
nontrivial. 



5 Discussion 

The random binary-tree model is highly simplified model and it seems not so physical in 
the sense that it is not directly related to actual patterns. However, actual branching patterns 
are usually affected stochastic effects, and the randomness is incorporated also in the random 
model. Advantage of the random binary-tree model is that analytical calculations can be widely 
performed, and that some of such calculations explain properties of actual patterns. Thus, 
the random model is important as a prototype of branching systems. In addition, another 
significance of the random model is concerned with statistical mechanics. A branching system 
with fluctuations can be regarded as a statistical ensemble, and each binary tree in Q n represents 
a microscopic state. From this point of view, the random model (fl n , P) is regarded as the 
microcanonical ensemble. (In fact, the uniform measure P corresponds to the principle of 
equal weight.) Therefore, the random model is important for the theoretical foundation of the 
statistical physics of branching systems. 

In the present paper, the asymptotic form of E n [/(SV,n)] and generalized topological self- 
similarity are confirmed asymptotically for a wide class of /. We only assume that / has an 
expression as in Eq. (j7|), which is a Laurent expansion of / around infinity. Hence, Eq. (j7j) is 
valid if / does not have an essential singularity at infinity. Polynomial and rational functions 
are typical examples of such functions, and as shown in the previous section, important random 
variables on the Horton-Strahler analysis are mostly polynomial or rational functions of S r>n . 

We proved topological self-similarity about / on the random model, and we expect that such 
a generalized topological self-similarity is also valid for some actual branching patterns. For 
example, topologically self-similar patterns, such as river networks, are expected to be topo- 
logically self-similar about /, for some class of /. We need further observational, experimental, 
and numerical researches for the solution of this problem. 

The random binary-tree model is a graph-theoretic model, where geometrical properties are 
all neglected. As a refinement of the Horton-Strahler analysis, ramification analysis [13-16] 
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describes how many side-branches emerge. Ramification analysis is still a topological model, 
but it is related to a fractal structure. Based on the methods and results in this paper, we expect 
that some asymptotic properties of random variables in ramification analysis are obtained, and 
that the more profound comprehension of a connection between topological self-similarity and 
original self-similarity can be obtained. 

6 Conclusion 

We have first introduced the transformation <& n in Sec. |2j and recursive equation ([6]) is 
obtained. Eq. (jSJ) can be solved asymptotically, if f(n) is expressed as in Eq. (J7J). Solution 
fTTH]) is the asymptotic form of E n [f(S^ n )}. A similar result ffTBT) is derived for a multivariable 
function. Topological self-similarity about / is confirmed in Eq. ffTBT) . We have also presented 
some calculations as examples. 
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